Project-Team:CQFD

Project-Team Cqfd

Members

Overall Objectives

Presentation

Research Program

Application Domains

Dependability and safety

New Software and Platforms

New Results

Highlights of the Year
Approximate Kalman–Bucy filter for continuous-time semi-Markov jump linear systems
Modeling and optimization of a launcher integration process
Numerical approximation for optimal stopping of MDP under partial observation
Classification of EEG signals by evolutionary algorithm
Probabilistic low-rank matrix completion with adaptive spectral regularization algorithms
Variable selection to construct indicators of quality of life for data structured in groups
Efficiency of simulation in monotone hyper-stable queueing networks
Control of parallel non-observable queues: asymptotic equivalence and optimality of periodic policies
The economics of the cloud: price competition and congestion
Generalized Nash Equilibria for Platform-as-a-Service Clouds
Stochastic approximations of constrained discounted Markov decision processes
Non-Parametric Estimation of the Conditional Distribution of the Interjumping Times for Piecewise-Deterministic Markov Processes
Approximation of average cost Markov decision processes using empirical distributions and concentration inequalities
Piecewise Deterministic Markov Processes based approach applied to an offshore oil production system
Optimal Trajectories for Underwater Vehicles by Quantization and Stochastic control
Multi-Objective Design and Maintenance Optimization of the Heated Hold-Up Tank Modeled by Piecewise Deterministic Markov Processes
Conditional quantile estimation through optimal quantization
Conditional quantile estimator based on optimal quantization: from theory to practice
QuantifQuantile : an R package for performing quantile regression trough optimal quantization
Transcriptome profile analysis reveals specific signatures of pollutants in Atlantic eels
Comparaison of kernel density estimators with assumption on number of modes : application on environmental monitoring data
A new sliced inverse regression method for multivariate response
An introduction to dimension reduction in nonparametric kernel regression
Hidden Markov Model for the detection of a degraded operating mode of optronic equipment
On the asymptotic behavior of the Nadaraya-Watson estimator associated with the recursive SIR method
Evolving Genetic Programming Classifiers with Novelty Search
Detecting mental states of alertness with genetic algorithm variable selection
A comparison of fitness-case sampling methods for Symbolic Regression
Geometric Semantic Genetic Programming with Local Search

Bilateral Contracts and Grants with Industry

Partnerships and Cooperations

Dissemination

Bibliography

Inria | Raweb 2014 | Presentation of the Project-Team CQFD


	PDF	e-Pub

previous

Home | Next next

next

Section: New Results

Stochastic approximations of constrained discounted Markov decision processes

Participants : Francois Dufour, Tomas Prieto-Rumeau.

We consider a discrete-time constrained Markov decision process under the discounted cost optimality criterion. The state and action spaces are assumed to be Borel spaces, while the cost and constraint functions might be unbounded. We are interested in approximating numerically the optimal discounted constrained cost. To this end, we suppose that the transition kernel of the Markov decision process is absolutely continuous with respect to some probability measure $μ$ . Then, by solving the linear programming formulation of a constrained control problem related to the empirical probability measure $μ_{n}$ of $μ$ , we obtain the corresponding approximation of the optimal constrained cost. We derive a concentration inequality which gives bounds on the probability that the estimation error is larger than some given constant. This bound is shown to decrease exponentially in $n$ . Our theoretical results are illustrated with a numerical application based on a stochastic version of the Beverton-Holt population model. This work has been published in Journal of Mathematical Analysis and applications: [27] .

previous

Home | Next next

next